Picture for Jonas Fischer

Jonas Fischer

Max Planck Institute for Informatics

Interpretability Without Tradeoffs: Disentangling Polysemanticity At Equal Predictive Performance

Add code
May 29, 2026
Viaarxiv icon

Seeing Through Circuits: Faithful Mechanistic Interpretability for Vision Transformers

Add code
Apr 15, 2026
Viaarxiv icon

Certified Circuits: Stability Guarantees for Mechanistic Circuits

Add code
Feb 26, 2026
Viaarxiv icon

Insight: Interpretable Semantic Hierarchies in Vision-Language Encoders

Add code
Jan 20, 2026
Viaarxiv icon

Temporal Concept Dynamics in Diffusion Models via Prompt-Conditioned Interventions

Add code
Dec 09, 2025
Viaarxiv icon

Disentangling Polysemantic Channels in Convolutional Neural Networks

Add code
Apr 17, 2025
Viaarxiv icon

VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow

Add code
Mar 28, 2025
Figure 1 for VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
Figure 2 for VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
Figure 3 for VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
Figure 4 for VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
Viaarxiv icon

Escaping Plato's Cave: Robust Conceptual Reasoning through Interpretable 3D Neural Object Volumes

Add code
Mar 17, 2025
Viaarxiv icon

Unlocking Open-Set Language Accessibility in Vision Models

Add code
Mar 14, 2025
Figure 1 for Unlocking Open-Set Language Accessibility in Vision Models
Figure 2 for Unlocking Open-Set Language Accessibility in Vision Models
Figure 3 for Unlocking Open-Set Language Accessibility in Vision Models
Figure 4 for Unlocking Open-Set Language Accessibility in Vision Models
Viaarxiv icon

Now you see me! A framework for obtaining class-relevant saliency maps

Add code
Mar 10, 2025
Figure 1 for Now you see me! A framework for obtaining class-relevant saliency maps
Figure 2 for Now you see me! A framework for obtaining class-relevant saliency maps
Figure 3 for Now you see me! A framework for obtaining class-relevant saliency maps
Figure 4 for Now you see me! A framework for obtaining class-relevant saliency maps
Viaarxiv icon